Skip to content

Conversation

@Nancheng-11
Copy link
Collaborator

feature - add fmha ut & fix build

feature - add torch mla in pymodel

fix - align deepseekv2 output using hack layer!!

fix - align deeseek v2 output using lite-chat

feature - support prefill & decode mla cpp ops

refactor - mv flashinfer mla ops to fmha.py

fix - add deps in BUILD

@CLAassistant
Copy link

CLAassistant commented Oct 10, 2025

CLA assistant check
All committers have signed the CLA.

@Nancheng-11 Nancheng-11 force-pushed the feature/pymodel_deepseek branch from 1074bfb to bf3806f Compare October 10, 2025 08:16
@LLLLKKKK
Copy link
Collaborator

需要 smoke 测试

@Nancheng-11 Nancheng-11 force-pushed the feature/pymodel_deepseek branch from bf3806f to f0b8cc7 Compare October 10, 2025 12:39
@Nancheng-11
Copy link
Collaborator Author

需要 smoke 测试

smoke test和一些镜像依赖包后续一并提交到main-internal分支

@Nancheng-11 Nancheng-11 force-pushed the feature/pymodel_deepseek branch from 04eeba7 to 51212f5 Compare October 11, 2025 13:23
@LLLLKKKK
Copy link
Collaborator

需要 smoke 测试

smoke test和一些镜像依赖包后续一并提交到main-internal分支

提交到 open_merge 分支一起跑 ci。

@Nancheng-11 Nancheng-11 force-pushed the feature/pymodel_deepseek branch 10 times, most recently from c12c735 to f860657 Compare October 16, 2025 06:29
@LLLLKKKK LLLLKKKK enabled auto-merge (rebase) October 16, 2025 07:25
@Nancheng-11 Nancheng-11 force-pushed the feature/pymodel_deepseek branch 17 times, most recently from 6c7a53d to 35dc90d Compare October 21, 2025 11:21
@Nancheng-11 Nancheng-11 force-pushed the feature/pymodel_deepseek branch from 35dc90d to 9aa750c Compare October 21, 2025 12:01
@LLLLKKKK LLLLKKKK merged commit 71c2807 into alibaba:main Oct 23, 2025
2 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants